Learning to Drive in the NGSIM Simulator Using Proximal Policy Optimization
نویسندگان
چکیده
As a popular research field, autonomous driving may offer great benefits for human society. To achieve that, current studies often applied machine learning methods like reinforcement to enable an agent interact and learn in stimulating environment. However, most simulators lack realistic traffic which cause deficiency interaction. The present study adopted the SMARTS platform create simulator trajectories of vehicles NGSIM I-80 dataset were extracted as background traffic. built was used train model using proximal policy optimization method. actor-critic neural network applied, takes inputs including 38 features that encode information host vehicle nearest surrounding lane adjacent lane. A2C selected comparative results revealed PPO outperformed task by collecting more rewards, traveling longer distances, encountering less dangerous events during training testing. achieved 84% success rate test is comparable related studies. proved public can provide useful tool driving.
منابع مشابه
Proximal Policy Optimization Algorithms
We propose a new family of policy gradient methods for reinforcement learning, which alternate between sampling data through interaction with the environment, and optimizing a “surrogate” objective function using stochastic gradient ascent. Whereas standard policy gradient methods perform one gradient update per data sample, we propose a novel objective function that enables multiple epochs of ...
متن کاملwillingness to communicate in the iranian context: language learning orientation and social support
why some learners are willing to communicate in english, concurrently others are not, has been an intensive investigation in l2 education. willingness to communicate (wtc) proposed as initiating to communicate while given a choice has recently played a crucial role in l2 learning. it was hypothesized that wtc would be associated with language learning orientations (llos) as well as social suppo...
on the relationship between self- regulated learning strategies use and willingness to communicate in the context of writing
این تحقیق به منظور بررسی رابطه بین میزان استراتژیهای خود-تنظیم شده یادگیری و تمایل به ایجاد ارتباط دانشجویان زبان انگلیسی انجام شده است.علاوه بر این،روابط و کنش های موجود بین ریزسنجه های استراتژیهای خود-تنظیم شده یادگُیری ، مهارت نگارش و تمایل به برقراری ارتباط و همچنین تاٍثیرجنسیت دانشجویان زبان انگلیسی در استراتژیهای خود-تنظیم شده یادگیری و تمایل به برقراری ارتباط آنها مورد بررسی قرار گرفته شد.
15 صفحه اولthe relationship between using language learning strategies, learners’ optimism, educational status, duration of learning and demotivation
with the growth of more humanistic approaches towards teaching foreign languages, more emphasis has been put on learners’ feelings, emotions and individual differences. one of the issues in teaching and learning english as a foreign language is demotivation. the purpose of this study was to investigate the relationship between the components of language learning strategies, optimism, duration o...
15 صفحه اولthe relationship between emotional intelligence, willingness to communicate in the classroom and learners’ beliefs about language learning: iranian efl learners in focus
یکی از زمینه های تحقیقاتی در حال پیشرفت، هوش عاطفی است که دارای پتانسیل بالایی برای به کار گیری در سیستم آموزشی می باشد که اخیرامرکز توجه تحقیقات روانشناسی آموزشی قرار گرفته است. این مطالعه تلاش می کند تا به بررسی ارتباط بین هوش عاطفی ، تمایل به برقراری ارتباط در کلاس درس و نگرش زبان آموزان نسبت به یادگیری زبان بپردازد. به منظور رسیدن به این هدف تعداد138 زبان آموز به طور تصادفی انتخاب شدند. سپس...
ذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Advanced Transportation
سال: 2023
ISSN: ['0197-6729', '2042-3195']
DOI: https://doi.org/10.1155/2023/4127486